Modelling Pronunciation Variability with Hierarchical Word Networks

نویسندگان

  • Serguei Koval
  • Natalia Smirnova
  • Mikhail Khitrov
چکیده

In the paper a new method is suggested to effectively capture pronunciation variability in ASR tasks. Its basic principle consists in structuring word-related phonetic feature space. For each word in the system dictionary, given its “ideal” transcription, as a result of application of specially designed modification rules and constraints, a network of its phonetic realisations is generated (the so-called hierarchical word network – HWN). In contrast to “allophone networks” HWNs provide adequate covering of all phone modifications (including articulatory laxing, contextual accommodation, accidental substitutions etc.) and allow for various levels of precision in the phonetic representation of word pronunciation variability. In view of using this representation for ASR tasks adequate sophistication of the word model is proposed with the introduction of the so-called hierarchical matching functions (HMF).

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Implicit Pronunciation Modelling in Asr

Modelling of pronunciation variability is an important part of the acoustic model of a speech recognition system. Good pronunciation models contribute to the robustness and portability of a speech recogniser. Usually pronunciation modelling is associated with the recognition lexicon which allows a direct control of HMM selection. However, in state-of-the-art systems the use of clustering techni...

متن کامل

Modelling pronunciation variations in spontaneous Mandarin speech

Pronunciation in spontaneous Mandarin speech tends to be much more variable than in read speech. In current recognition systems, pronunciation dictionaries usually only contain one standard pronunciation for each word, so that the amount of variability that can be modelled is very limited. Most recent research work for modelling variations in spontaneous speech focuses on the lexicon level, whi...

متن کامل

Pronunciation Modelling for Conversational Speech Recognition: a Status Report from Ws97

Accurately modelling pronunciation variability in conversational speech is an important component for automatic speech recognition. We describe some of the projects undertaken in this direction at WS97, the Fifth LVCSR Summer Workshop, held at Johns Hopkins University, Baltimore, in July-August, 1997. We first illustrate a use of hand-labelled phonetic transcriptions of a portion of the Switchb...

متن کامل

Rule-based Word Pronunciation Networks Generation for Mandarin Speech Recognition

Modeling pronunciation variation in spontaneous speech is very important for improving the recognition accuracy. One limitation of current recognition systems is their dictionaries for recognition only contain one standard pronunciation for each entry, so that the amount of variability that can be modeled is very limited. In this paper, we proposed to generate pronunciation networks based on ru...

متن کامل

Pronunciation Modeling for Large Vocabulary Speech Recognition by Arthur

The large pronunciation variability of words in conversational speech is one of the major causes of low accuracy for automatic speech recognition (ASR). Many pronunciation modeling approaches have been developed to address this problem. Some explicitly manipulate the pronunciation dictionary as well as the set of the units used to define the pronunciations of words. Others model the pronunciati...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2000